N-Body Simulations Using Message Passsing Parallel Computers

نویسندگان

  • Ananth Grama
  • Vipin Kumar
  • Ahmed H. Sameh
چکیده

In this paper, we present new parallel formulations of the Barnes-Hut method for n-body simulations on message passing computers. These parallel formulations partition the domain eeciently incurring minimal communication overhead. This is in contrast to existing schemes that are based on sorting a large number of keys or on the use of global data structures. The new formulations are augmented by alternate communication strategies which serve to minimize communication overhead. The impact of these communication strategies is experimentally studied. We report on experimental results obtained from an astrophysical simulation on an nCUBE2 parallel computer. The n-body problem simulates the behavior of n particles, each of which is innuenced by every other particle during a time-step. An exact formulation of this problem therefore requires calculation of n 2 interactions between each pair of particles. Many approximate algorithms have been devised that reduce the complexity of this problem. Most of these algorithms are based on a hierarchical representation of the domain using a spatial tree data structure. The leaf nodes consist of aggregates of particles. Each node in the tree contains a series representation of the effect of the particles contained in the subtree rooted at the node. These representations are typically based on Taylor or Legendre polynomials. Interactions between nodes and particles are dictated by a multipole acceptance criteria (MAC). Diierent algorithms use diierent MAC. Selection of an appropriate MAC is critical to controlling the error in simulation. Methods in this class include those due to Appel, Barnes and Hut, and Greengard and Rokhlin (Fast Multi-pole Method). Even with the reduced complexity of these algorithms, the n-body problem takes a significant amount of time because of the large number of particles and time steps. EEcient parallel formulations therefore need to be developed. The Barnes-Hut method is one of the most popular methods due to its simplicity. Although its computational complexity of O(n log n) is more than that of the Fast Multipole Method (FMM), which is O(n))4], the associated constants are smaller for the Barnes-Hut method particularly for simulations in three dimensions. Furthermore, FMM uses a number of complicated data structures that make it diicult to program. However, FMM has proven error bounds unlike the Barnes-Hut method. Warren and Salmon 9] present a variant of the Barnes-Hut method that has a good worst-case error bound. In this paper, we describe parallel formulations of the Barnes-Hut method. The parallel formulations presented in …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Spatial Pyramid Match Kernel Algorithm for Object Recognition using a Cluster of Computers

This paper parallelizes the spatial pyramid match kernel (SPK) implementation. SPK is one of the most usable kernel methods, along with support vector machine classifier, with high accuracy in object recognition. MATLAB parallel computing toolbox has been used to parallelize SPK. In this implementation, MATLAB Message Passing Interface (MPI) functions and features included in the toolbox help u...

متن کامل

The Parallel Implementation of N - body

This dissertation studies issues critical to e cient N-body simulations on parallel computers. The N-body problem poses several challenges for distributed-memory implementation: adaptive distributed data structures, irregular data access patterns, and irregular and adaptive communication patterns. We introduce new techniques to maintain dynamic irregular data structures, to vectorize irregular ...

متن کامل

Scattering From Complex Geometries Using a Parallel FVTD Algorithm

A D explicit nite volume algorithm has been developed to simulate scattering from complex geometries on parallel computers using structured body conformal curvilinear grids Most simulations with realistic D ge ometries require a large number of grid points for adequate spatial resolution making them suitable to parallel computation The simulations have been carried out using a multi block zonal...

متن کامل

A domain decomposition parallel processing algorithm for molecular dynamics simulations of systems of arbitrary connectivity

We describe in this paper methods for applying domain decomposition to a general purpose molecular dynamics program. The algorithm is suitable for either distributed memory parallel computers or shared memory machines with message passing libraries. A method is discussed in detail which allows molecules of arbitrary connectivity to be simulated within the domain decomposition approach. The algo...

متن کامل

Toward Human-Level Massively-Parallel Neural Networks with Hodgkin-Huxley Neurons

This paper describes neural network algorithms and software that scale up to massively parallel computers. The neuron model used is the best available at this time, the Hodgkin-Huxley equations. Most massively parallel simulations use very simplified neuron models, which cannot accurately simulate biological neurons and the wide variety of neuron types. Using C++ and MPI we can scale these netw...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995